AITopics

2510.06548

Country:

Europe > Austria (0.28)
Asia > Japan (0.28)

Genre: Research Report > New Finding (0.66)

Industry: Education (0.92)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.93)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (0.68)

Umeda, Hikaru, Iiduka, Hideaki

Optimal Growth Schedules for Batch Size and Learning Rate in SGD that Reduce SFO Complexity

arXiv.org Artificial IntelligenceAug-8-2025

The unprecedented growth of deep learning models has enabled remarkable advances but introduced substantial computational bottlenecks. A key factor contributing to training efficiency is batch-size and learning-rate scheduling in stochastic gradient methods. However, naive scheduling of these hyperparameters can degrade optimization efficiency and compromise generalization. Motivated by recent theoretical insights, we investigated how the batch size and learning rate should be increased during training to balance efficiency and convergence. We analyzed this problem on the basis of stochastic first-order oracle (SFO) complexity, defined as the expected number of gradient evaluations needed to reach an $ε$-approximate stationary point of the empirical loss. We theoretically derived optimal growth schedules for the batch size and learning rate that reduce SFO complexity and validated them through extensive experiments. Our results offer both theoretical insights and practical guidelines for scalable and efficient large-batch training in deep learning.

artificial intelligence, batch size, machine learning, (16 more...)

2508.05297

Genre: Research Report > New Finding (0.48)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

arXiv.org Artificial IntelligenceJun-14-2024

VEGA: Learning Interleaved Image-Text Comprehension in Vision-Language Large Models

Zhou, Chenyu, Zhang, Mengdan, Chen, Peixian, Fu, Chaoyou, Shen, Yunhang, Zheng, Xiawu, Sun, Xing, Ji, Rongrong

The swift progress of Multi-modal Large Models (MLLMs) has showcased their impressive ability to tackle tasks blending vision and language. Yet, most current models and benchmarks cater to scenarios with a narrow scope of visual and textual contexts. These models often fall short when faced with complex comprehension tasks, which involve navigating through a plethora of irrelevant and potentially misleading information in both text and image forms. To bridge this gap, we introduce a new, more demanding task known as Interleaved Image-Text Comprehension (IITC). This task challenges models to discern and disregard superfluous elements in both images and text to accurately answer questions and to follow intricate instructions to pinpoint the relevant image. In support of this task, we further craft a new VEGA dataset, tailored for the IITC task on scientific content, and devised a subtask, Image-Text Association (ITA), to refine image-text correlation skills. Our evaluation of four leading closed-source models, as well as various open-source models using VEGA, underscores the rigorous nature of IITC. Even the most advanced models, such as Gemini-1.5-pro and GPT4V, only achieved modest success. By employing a multi-task, multi-scale post-training strategy, we have set a robust baseline for MLLMs on the IITC task, attaining an $85.8\%$ accuracy rate in image association and a $0.508$ Rouge score. These results validate the effectiveness of our dataset in improving MLLMs capabilities for nuanced image-text comprehension.

dataset, graph, iitc task, (16 more...)

2406.10228

Country: Asia > China > Fujian Province > Xiamen (0.04)

Genre: Research Report > New Finding (1.00)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.89)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.67)

Jahan, Israt, Laskar, Md Tahmid Rahman, Peng, Chun, Huang, Jimmy

Evaluation of ChatGPT on Biomedical Tasks: A Zero-Shot Comparison with Fine-Tuned Generative Transformers

arXiv.org Artificial IntelligenceAug-24-2023

ChatGPT is a large language model developed by OpenAI. Despite its impressive performance across various tasks, no prior work has investigated its capability in the biomedical domain yet. To this end, this paper aims to evaluate the performance of ChatGPT on various benchmark biomedical tasks, such as relation extraction, document classification, question answering, and summarization. To the best of our knowledge, this is the first work that conducts an extensive evaluation of ChatGPT in the biomedical domain. Interestingly, we find based on our evaluation that in biomedical datasets that have smaller training sets, zero-shot ChatGPT even outperforms the state-of-the-art fine-tuned generative transformer models, such as BioGPT and BioBART. This suggests that ChatGPT's pre-training on large text corpora makes it quite specialized even in the biomedical domain. Our findings demonstrate that ChatGPT has the potential to be a valuable tool for various tasks in the biomedical domain that lack large annotated data.

large language model, machine learning, natural language, (21 more...)

2306.04504

Country:

North America > United States > Washington > King County > Seattle (0.04)
North America > United States > Maryland > Montgomery County > Gaithersburg (0.04)
North America > Canada > Ontario > Toronto (0.04)
(2 more...)

Genre: Research Report > New Finding (1.00)

Industry:

Health & Medicine > Therapeutic Area > Infections and Infectious Diseases (1.00)
Health & Medicine > Therapeutic Area > Immunology (1.00)
Health & Medicine > Pharmaceuticals & Biotechnology (1.00)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

#artificialintelligenceMar-17-2023, 13:25:18 GMT

Understanding Gaussian Elimination part3(Machine Learning)

Abstract: The Gaussian Elimination with Partial Pivoting (GEPP) is a classical algorithm for solving systems of linear equations. Although in specific cases the loss of precision in GEPP due to roundoff errors can be very significant, empirical evidence strongly suggests that for a {\it typical} square coefficient matrix, GEPP is numerically stable. We obtain a (partial) theoretical justification of this phenomenon by showing that, given the random n n standard Gaussian coefficient matrix A, the {\it growth factor} of the Gaussian Elimination with Partial Pivoting is at most polynomially large in n with probability close to one. This implies that with probability close to one the number of bits of precision sufficient to solve Ax b to m bits of accuracy using GEPP is m O(logn), which improves an earlier estimate m O(log2n) of Sankar, and which we conjecture to be optimal by the order of magnitude. Abstract: Linear reversible circuits represent a subclass of reversible circuits with many applications in quantum computing.

algorithm, gaussian elimination, gaussian elimination part3, (10 more...)

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.40)

#artificialintelligenceJan-12-2023, 10:15:37 GMT

Machine Learning as a Service Market 2023 Demand, Growth, Technology Trends, and Forecasts by 2032

QMI Market Research Published Latest Machine Learning as a Service Market 2032 Study with an in-depth analysis of the current scenario, the Market size, demand, growth pattern, trends, and forecast. By following several steps of collecting and analyzing market data, this finest market research report is structured by expert team. This Machine Learning as a Service Market report highlights key market dynamics of the sector and encompasses historic data, present market trends, environment, technological innovation, upcoming technologies, and technical progress in the related industry. Moreover, the Machine Learning as a Service report also contains all the information including market definition, classifications, key developments, applications, and engagements while detailing the actions of key players with respect to product launches, joint ventures, developments, mergers, and acquisitions and effects of the same in terms of sales, import, export, revenue and CAGR values. An excellent market research report can be generated only with the leading attributes such as the highest level of spirit, practical solutions, committed research and analysis, innovation, talent solutions, integrated approaches, the most up-to-date technology, and dedication.

artificial intelligence, learning, machine learning, (15 more...)

Country:

Europe (0.99)
North America > United States (0.15)
Asia > Middle East (0.15)

Genre: Research Report (0.99)

Industry:

Marketing (1.00)
Banking & Finance > Trading (1.00)

Technology: Information Technology > Artificial Intelligence > Machine Learning (1.00)

#artificialintelligenceJun-17-2021, 15:11:24 GMT

Machine Learning Courses Market Report 2021 Growth Factors, Product Type, Manufacturers, Application, End User and Regions 2027

Major players in the Machine Learning Courses market are profiled with company overview, financial overview, product portfolios, recent developments, and strengths and weaknesses. Top Players...

end user and region 2027, learning course market report 2021, streaming, (6 more...)

Industry: Education (0.60)

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.60)

#artificialintelligenceNov-20-2020, 14:49:18 GMT

Deep Learning Market: Growth Factors, Applications, Regional Analysis, Key Players and Forecasts by 2026 – The Think Curiouser

Deep Learning market research study provides an all-inclusive assessment of the market while propounding historical intelligence, actionable insights, and industry-validated & statistically-upheld market forecast. A verified and suitable set of assumptions and methodology has been leveraged for developing this comprehensive study. Information and analysis of key market segments incorporated in the report have been delivered in weighted chapters. Global "Deep Learning Market" research report provides the historical, present & future situation of Market Size & Share, Revenue, the demand of industry and the growth prospects of the Deep Learning industry in globally. This Deep Learning Market report has all the important data and analysis of market advantages or disadvantages, the impact of Covid-19 analysis & revenue opportunities and future industry scope all stated in a very clear approach.

application, deep learning market, key player and forecast, (8 more...)

Genre: Research Report > Experimental Study (0.60)

Industry:

Health & Medicine > Therapeutic Area > Infections and Infectious Diseases (0.70)
Health & Medicine > Therapeutic Area > Immunology (0.70)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Ndiaye, Babacar Mbaye, Balde, Mouhamadou A. M. T., Seck, Diaraf

Visualization and machine learning for forecasting of COVID-19 in Senegal

arXiv.org Machine LearningAug-6-2020

In this article, we give visualization and different machine learning technics for two weeks and 40 days ahead forecast based on public data. On July 15, 2020, Senegal reopened its airspace doors, while the number of confirmed cases is still increasing. The population no longer respects hygiene measures, social distancing as at the beginning of the contamination. Negligence or tiredness to always wear the masks? We make forecasting on the inflection point and possible ending time.

artificial intelligence, forecasting, machine learning, (17 more...)

arXiv.org Machine Learning

2008.03135

Country:

Africa > Senegal > Dakar Region > Dakar (0.06)
North America > United States > New York (0.04)
Europe > United Kingdom > England > Oxfordshire > Oxford (0.04)
(9 more...)

Genre: Research Report (0.65)

Industry:

Health & Medicine > Therapeutic Area > Infections and Infectious Diseases (1.00)
Health & Medicine > Therapeutic Area > Immunology (1.00)
Health & Medicine > Epidemiology (1.00)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Statistical Learning > Regression (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)

#artificialintelligenceAug-4-2020, 16:05:17 GMT

Future Fields is tackling cultured meat's biggest problem -- #ArtificialIntelligence #StartUp #iot #robotics #AI

One possible solution to cellular agriculture's biggest problem -- how to develop a cheap, humane, growth material for cultured meat -- may have come from a conversation in line at a Tim Hortons in Alberta. The husband and wife duo of Matt and Jalene Anderson-Baron were waiting for Timbits and coffee and talking about the technology behind their startup, Future Fields, when Jalene suggested a possible new growth medium. Matt Anderson-Baron had hit a wall in his research, and the pair, which represented two-thirds of the founding triumvirate of Future Fields, were out for a snack. Along with co-founder Lejjy Gafour, the three friends had set out to launch a startup from Canada that could do something about the world's reliance on animals for protein. They recognized that the attendant problems associated with animal farming were unsustainable at a scale needed to meet global demand for meat.

anderson-baron, artificial intelligence, future field, (12 more...)

Country:

North America > Canada > Alberta (0.37)
Asia > Singapore (0.05)

Industry: Food & Agriculture > Agriculture (0.37)

Technology: Information Technology > Artificial Intelligence > Robots (0.40)